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Abstract 

In this paper we offer a metric similar to graph edit distance which measures the dis- 
tance between two (possibly infinite)weighted graphs with finite norm (we define the norm 
of a graph as the sum of absolute values of its edges) . The main result is the completeness 
of the space. Some other analytical properties of this space are also investigated. 

1 Introduction 

Many objects can be demonstrated with weighted graphs. In any collection of objects of similar 
nature a way to quantify the difference between objects may be desired (For instance if we were 
to select the most similar objects to a given object from a database). In the theoretical side 
one common way is to develop a metric on the space of objects in demand. One way to build 
a metric, is to define some operations that transform the members of the space to one another, 
and assign a cost to each operation then define the distance between two objects to be the 
minimum cost that must be payed to transform the first object to the second via a sequence 
of the defined operations. Such metrics sometimes are referred to as "Edit distance". Two 
examples of them are the "Levenshtein edit distance" [Ij on strings and "Graph edit distance" 
[2] on the space of finite graphs. This paper extends the Graph edit distance to the space of 
"countable weighted graphs with finite norm" and investigates some topological properties of 
the space. 



2 Priliminaries and intuitive examples 

In this chapter, we introduce the concepts intuitively. The main question here is "given two 
graphs, how much they differ?" . Based on this question we could define different distances, we 
choose here "Graph edit distance" and we generalize it to infinite graphs. 

Example 1. Consider the following two graphs 
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One can transform G\ to G2, by adding a vertex and an edge to Gi and deleting an edge from 
it 




Given two graphs, it is possible to transform one to the other by addition and deletion of some 
vertices and edges. The minimum number of edge addition and deletions in such a process 
is the distance between the two graphs and is denoted by d(Gi,G2)- In the above example 
d(Gi, G2) = 2, because we added an edge and deleted one. It is clear that if two graphs differ 
only in isolated vertices, then by this definition their distance is zero. 

Example 2. Consider the following weighted graphs 




We transform Gi to G2 as follows 
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The right-hand graph, after removing the zero weighted edge and it's endpoints is same as G2- 
We define the distance between two edge-weighted graphs to be the minimum amount of edge- 
weight modifications required to transform one to the other (zero-weighted edges and isolated 
vertices could be added and deleted for free) , in this example d(G'i, G2) = |1| + |1| + |— 2| = 4. 



Example 3. We construct the sequence {Gn} as follows 



Gi : 1 



G2 



Ifn<m then clearly 

m ^ 

d(Gn,G^)< ^ 

t=n+l 

hence {G^} is a Cauchy sequence. On the other hand {Gn} does not approach to a finite graph 
(to prove that let G be a graph with m edges, and show that for each n > m, d(G„, G^) > 
l/{m + l)), implying that the space of finite weighted graphs is not complete. But this sequence 
approaches to the following infinite graph 
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Example 4. Consider the following two graphs 
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G is obtained from G by removing the joints. You should be able to verify the following inequality 
for arbitrary graphs intuitively. 

d{G,H) < d{G,H) 

3 Developing the metric mathematically 

We first define the distance between two labeled graphs and then define unlabeled graphs as 
the equivalence classes of labeled graphs. The main result in this chapter is to show that the 
introduced distance provides us a metric space. 

Definition 1. Fix V = {vi,V2,V3, ■ ■ ■} as the vertex set. We show the set of all 2'element 
subsets of V by E (edge set). A labeled graph is a function w : E W. The value w{e), is 
the weight of e(inw). A nonzero edge (of w), is one which it's weight is nonzero. An isolated 
vertex of w is a vertex that has no nonzero edge attached, w is standard if it has infinitely 
many isolated vertices. We denote by W the set of all labeled graphs, and by the set of all 
standard labeled graphs. Forw,w' G W, the norm ofw, the distance between w andw' and the 
sets Wq and Wq are defined as follows 

1^1 =EeeEk(e)l 

d(u', w') — \w — w'\ 
Wo = \ \w\< ooj 

W is a vector space and Wq is a normed subspace of it which, under metric d is isometric to 
/i(M). w,w' E W are isomorphic(w ~ w'), when there exists a bijection f : V V such that 
for every edge uv G E, w{uv) = w'{f{u), f{v)). 

We use standard graphs in constructing unlabeled weighted graphs, it has several benefits, 
in particular with an infinite number of isolated vertices we don't need to delete or add a vertex. 

Definition 2. Let Q = W^ / ~ and Qq = Wq/ ~. The elements of Q are named unlabeled 
graphs. For w G W, we show the equivalence class of w by w and define 

\w\ — \w\ 
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When no confusion can arise we use the term "graph" instead of "labeled graph" and 
"unlabeled graph". A graph with all edge weights equal to 0, simply is denoted by 0. We 
sometimes use common graph theory notions here, converting them to match our definitions is 
not difficult, for example "to delete an edge" means "to change its weight to zero". 

Definition 3. We denote by SiV) the set of all bijections on V . Suppose a G S{V) and 
e — uv & E, let us define a{e) = {(j{u), (j{v)}. Also for w e W, define w"^ &W as follows 



\/eeE w'^iaie)) = w{e) 



or equivalently 



yeeE w^ie) ^w{a-\e)) 



obviously for w,w' &W 



w w' <(=^ 3(7 e S{y) ■.w'^w'' 




Lemma 1. Given w,w' and cr, 7 e S{V), the following equations hold 




— w 



Definition 4. Given two graphs G, G' 



e Q, we define their distance as follows 




Also for w & G we define 




Lemma 2 



For Wi,W2 G 



d{wi, W2) = d('u;i, ^2) = d{wi, W2) 



proof. 



d«,0=d(«;i,«;2" 




{d«,0 I (71,(72 e S{V)] = {dK,w?) I a e S{V))] 
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w{e) e e E' 
otherwise 



Lemma 3. For wi, W2, tfs G 

d{wi, ws) < d{wi, W2) + d(w2, w^) 

proof. 

d(wi, W2) + d(w2, W3) = d(wi, W2) + d(w2, W3) > d(wi, W3) 

■ 

Definition 5. Le^ w e W^, >1 C R and E' C l^e define graphs Cut(w, A), Cut(w, E') e 
as follows 

^ ' I otherwise 

^/lis means to delete all edges with weights outside of A 

Cut{w,E'){e) = 

also for e > 

Cut('u;, e) — Cut('u;, (—00, — e] U [e, 00)) 
moreover for w e we have two more definitions 

CvX{w , A) = Cxit{w , A) and Cut(i/;, e) = Cut('u;, e) 

Definition 6. The spectrum of a graph is the set of all of it's edge weights 

Spec('u;) = Spec('u;) = |w(e) | e e E^ 

Lemma 4. If w & Wq then Spec(w) is a countable and compact set, furthermore the only 
possible limit point of it is 0. 

Lemma 5. IfG,He Qo and d(G, H) = Q then for every e > 0, Cut(G, e) = Cut(i7, e). 
proof. Let 

^4 = — I X e Spec(G), y e Spec(i?), x ^ y, {\x\ > e or \y\ ^ ^)| 

If ^4 = then Cut(G', e) = = Cut (if, e), otherwise we can define 

5 — min A 

Clearly 5 > 0. Choose w e G and w' e H such that \w — w'\ < 5. For e e E, \w{e) — w'{e)\ < 5, 

and two cases are possible 

case i: \w{e) \ , \w'{e)\ < e, which implies 

Cut(w, e)(e) = = Cut(w', e)(e) 



case ii: w{e) > e or w'{e) > e, in this case w{e) = w'{e) because otherwise \w{e) — w'{e)\ e A, 
and hence S < S which is a contradiction. Thus 

Cut(w, e) (e) — w{e) — w\e) — Cut(w', e) (e) 

therefore in each case, Cut(w, e) = Cut(u'', e) and consequently Cut(G, e) = Cut(if, e). ■ 

In the following lemma which is known as Konig infinity lemma, please forget our notion of 
a graph, just take it as in ordinary graph theory texts. 

Lemma 6. Ai, A2, A3, • • • are nonempty, finite and disjoint sets, and G is a graph with 
yj^^iAn as vertex set, such that for every n, every vertex in An+i has a neighbour in An- G 
contains a ray 010,2^3 ■ ■ ■ with a„ G (A ray is a sequence of different vertices each of which 
adjacent to it's successor) 

Theorem 1. if w, w' e Wq and d('u;, w') — then w — w' , and consequently d is a metric on 

proof. Assume that w„ = Cut(w, ^) and w!^ = Cut(w', ^). The above lemma implies that 
Wn ^ w'^. Denote by C/„ and the sets of non-isolated vertices of Wn and w!^, and by A^ the 
set of all pairs (n, /) in which / is an isomorphism between nonzero parts of Wn and w'^ 

An = {{n, f)\f : t/n ^ Vm, V e Ur,{u ^v^ Wn{uv) = w'^{f{u), f{v)))} 

Since Wn ~ w'^ and C/„, are finite, A^ is nonempty and finite. Define a graph with vertex set 

U~^^ A„ and edge set |{(?7,, f),{n + 1, g)} \ f Q grj. Consider {n + l,g) e A^+i. Let / be the 

restriction of g to Un, it is easily seen that (n, /) e An and (n, /) is a neighbor of (n + 1, g) so 
each vertex in An+i has a neighbour in An- Then according to Konig infinity lemma, there is 
an infinite sequence (1, /i), (2, /2), (3, /s), ■ ■ ■ of vertices such that for each n, the vertex (n, /„) 
is adjacent to the vertex (n + 1, /„+i), i.e /i C /a C /g C • • • . We put / = |J^^ /„• / is an 
isomorphism between nonzero parts of w and w'. Since both w and w' have a countable number 
of isolated vertices, / can be extended to an isomorphism between w and w'. ■ 

4 Completeness of Qq 

In this chapter and the next one we try to find some topological properties of Qq. The main 
result in this chapter is the completeness of Qq. 

Theorem 2. Let G, Gi, G2, • " " £ Go- The following are equivalent 

1. Gn^G 

2. for every w E G there is a sequence Wn G Gn such that Wn ^ w 

3. there is a w & G and a sequence Wn e Gn such that Wn ^ w 
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proof. (1) =^ (2): Let w e G, since 

d(G„, G) = dlG,, w) = inf jdK, ^) I ^' e Gn} 

the elements G (j„ exist such that d{wn,w) < d{Gn,G) + -, therefore Wn w. (2) =^ (3) 
is evident 

(3) (1) It follows from the inequality 

d(Gn,G)<d(wn,w) 



Theorem 3. // the sequence C be convergent to a graph in Wo, then the sequence 

{wn} is convergent in Qq. 

proof. Suppose Wn ^ w, we shall define the graphs w'^ and w' by means of the following 
equations 



„/ „\- f ^nivi/2Vj/2) i,J are even 

Wif^j) - I otherwise 

w'(v v ) = I ^(^V2^^i/2) hJ are even 

^ ^ 1^ otherwise 



we simply observe that w' is a standard graph and Wn ~ w'^ and d{w'^, w') = d{wn, w) so 
w'n — > w', therefore w'^ — > w' and accordingly Wn — w'. ■ 

Definition 7. Suppose that w,w' e W. w is a subgraph of w' (w ^ w') when 

MeeE (w{e) = V ^(e) = ^'(e)) 

also we say, G is a subgraph of G' (G ^ G' ) where G, G' e Q, if one of these equivalent 
statements holds 

3weG 3w' eG' (w ^ w'^ 
Vw e G 3w' eG' (w ^ w') 
Vw' eG'3weG (w di w') 

Clearly if w, w' e Wq and w -< w' then \w\ < \w'\. Also -< is a partial order on W . 
Theorem 4. -< is a partial order on Qq. 

proof. The transitive and reflexive properties are consequences of the similar properties in W . 
The proof of the antisymmetric property: If this property fails, then G ^ G' ^ G holds for 
some G,G' e Qo, implying |G| < < which is impossible. ■ 

Theorem 5. IfG, G' e Qq, then 
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1. d(G',G") > IIGI - |G"|| 

2. IfGdi G' then d(G, G') = \G'\ - \G\ 
proof. 1) For each w E G and w' G G' we have 

\w-w'\ >\\w\- \w'\ \ = - 

so d{G,G') > \\G\ - \G'\\. 

2) Choose w E G and w' e G' such that w ^w' , clearly 

\w' — w\ = \w'\ — \w\ = \G'\ — \G\ 

which in combination with (1) gives the result. ■ 

If (ji :< G2 G3 ^ ■ ■ • is an increasing sequence in Q, then there exists an increasing 
sequence wi ^ W2 ^ ^ ■ ■ ■ such that for every n E N, Wn E Gn- It is enough to select wi 
from Gi then construct other terms inductively. 

Theorem 6. Suppose that w,wi,W2,W3,--- G Wo, Wn — ^ w and {an} is a sequence of 
nonnegative real numbers converging to zero, it follows that Cut('u;„, a„) w. 

proof. We take e > and select 5 > satisfying Cut(w, [—5, 5\) < e and choose Ni e N such 
that 

Vn > iVi a„ < 5 

we also set ^ = |e G -B I \w{e)\ > 5^. Since A is finite, we can select N2 and A^3 in such a way 
that 

Vn>A^2VeGA \wn{e)\ > 5 
Vn > A^3 \wn — w\ < e 

Now for n > max{7Vi, N2,Ns} 

|CutK,a„)| > |CutK,^)| > \Cut{w,A)\-\Cut{w-Wn,A)\ 

> \w\ — |Cut(w, [—6, 6])\ — |w — Wn\ > \w\ — e — e = \w\ — 2e 

\W - Cut{Wn,an)\ <\W- Wn\ + \Wn - Cut{Wn,an)\ 

< e + |tt;„| — |Cut(u'„, a„)| < e + \w\ + e — {\w\ — 2e) = 4e 

■ 

Theorem 7. In Wq (or Qq), any increasing hounded sequence(with respect to z^) is convergent. 
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proof. Suppose that is an increasing bounded sequence in Wq. We define the graph w 

as follows 

w{e) = lim Wn{e) 

n— >oo 

The sequence Wn{e) is ultimately constant, so the limit exists. It is easily seen that 

\w\ = lim \wn\ < oo =^ w e Wq 

n— >oo 

Given the fact that w„ ^ w for any n, we have 

lim d{w,Wn) — lim {\w\ — |w„|) = 

n—*oo n— ♦oo 

therefore ^ w. 

Now, suppose {Gn} is an increasing bounded sequence in Qq. Corresponding to this sequence, 
there is an increasing sequence {wn} Q Wq such that w„ e Gn- The convergence of {Gn} is a 
result of the convergence of ■ 

Definition 8. A graph in which no two nonzero edges are adjacent is called a jointless one. 
We denote by Q and Qq the sets of jointless graphs in Q and Qq. Given a graph G e Qq, there 
is a unique member of which has the same edge weights as G (with same multiplicity), we 
denote it by G, see example 4- 

Definition 9. Suppose G G G and a G Spec(G). The unique graph obtained from G by 
deleting an edge with weight a is denoted by Del(G, a) . 

Definition 10. Suppose w = G E Qq. We define Code(w) = Code(G) = / where f e is 
constructed by induction as follows: set Gq — G and define 

„, ^ f max Spec(Gn-i) n is odd „ „ „, 

= I min Spec(G:_0 n is even «" = Del(G„_„ /(„)) 

Example 5. Suppose that G is the jointless graph that has an edge with weight ^ for each 
n>l and two edges of weight —1 , then 

Code(G) = (l,-l,^,-l,l,0,^,0,---) 

Lemma 7. Suppose G,H E Qo, xi = maxSpec(G') and X2 = maxSpcc(i/) (or Xi — 
minSpec(G) and X2 = minSpec(i7) ) and G' = Del{G,Xi) and H' = Del{H,X2), then 

d{G,H)^d{G',H') + \xi-X2\ 
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proof. It is clear that d{G,H) < d{G',H') + \xi — X2\. To show the inverse, we take wi ^ G 
and W2 e H. Let Xi = Wi{ei), X2 = ^2(62), yi = 1/^1(62) and 7/2 = ^2(61) and define the graph 
w as follows 

{xi e = 62 

yi e = ei 

Wi{e) otherwise 

according to the assumption, yi < Xi and 7/2 < 2:2 (or Xi < yi and X2 < y2 )■ Using these 
relations one can easily show that 

d{wi,W2) - d{w,W2) = l^i - y2\ + \x2 - yi\ - \xi - X2\ - \y2 -yi\>0 
=^ d{wi, W2) > d{w, W2) > |a;i - a;2| + d(G', H') 

■ 

Theorem 8. For every G,H e Qq we have 

d{G,H) = ICode(G') - Code{H)\ 

proof. Let (?„ be the sequence related to graph G in definition 10, and relate a similar sequence 
Hn to H. We set Code(G) = g and Gode{H) = h. Applying the preceding lemma n times, we 
obtain 

n 

d{G, H) = Y, \g{n) - h{n)\ + d(G„, H^) 

i=l 

Since the sequences and \Hn\ are convergent to 0, d(G„,if„) — > and consequently 
d{G,H) = J2 lain) - h{n)\ = |Code(G) - Code{H)\ 

i=l 

■ 

Lemma 8. Qq is a complete subspace of Qq 

proof. Set = I Code(G) | G e it is easily seen that 



B 



^{feh\yn (/(2n - 1) > 0, f{2n) < 0, 
/(2n - 1) > /(2n + 1), f{2n) < f{2n + 2)) } 



and B is a closed subset and consequently a complete subset of /i(]R). According to the theorem 
8, Code : ^0 — ^ -B is an onto isometry, so Qq is also complete. ■ 

By a correspondance between two graphs Gi,G2 £ Q we mean a choice of two members 
wi e Gi and W2 G G2- 

Theorem 9. Qq is a complete metric space 
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proof. Suppose that Gn is a Cauchy sequence. Since d((j„, Gm) < d(G„, Gm), so the sequence 
{Gn} is also Cauchy. Therefore according to the preceding lemma, the sequence {Gn} is 
convergent to a jointless graph F. Set A = Spec(F) and let w„ G Gn- Suppose w'^ is obtained 
from Wn by rounding the weight of each edge to the closest number in A (if the weight of an edge 
has the least difference with two numbers in A, we choose one of them arbitrarily). Set Hn = w'n, 
we claim that d{Hn, Gn) 0. In fact, it is easily seen that d((j„, Hn) < d(w„, w'n) < d(G'„, F), 
which proves the claim. Therefore, it is enough to show {if„}'s convergence instead of {Gn j's. 
First let us show that for all e > there exists a natural number M such that 

Vm > M Cut(i7„, e) = Cut (//a/, e) (1) 

To prove that we set = |x G A | |x| > e| and 6 = min |d(x, | x G . Since 

A does not have a nonzero limit point and A^ is finite, so 5 > 0. The relation d(if„, Gn) 
shows that {Hn} is a Cauchy sequence so there is a M G N such that d{HM, H^) < S for 
every m > M. Now, if Cut(ifm,e) ^ Cut(ifM,e) for one m > M, in every correspondence 
between H^ and Hm we get an edge which has two different weights in the two graphs, one 
of which from A^, and the other from A, and hence ([{Hm-, Hm) > 5, which is a contradiction. 
Choose a strictly increasing sequence {M„} such that for each ra, e = - and M = Mn satisfy 
the equation ([1]). It is evident that for each n, Cut(ifAf„, ^) ^ Cut(ifjvf„+i5 ;^)- Then, since 
\Cut {Hm„, ^)| < \HmJ, so the sequence {Cut{HM„, ^)} is bounded and consequently converges 
to a graph G. We have 

d{Hn, Gn) < d{Hn, Gn) =^ d(^„, G„) 

^Hn^F^HM^^F^ (2) 
Cut(^M„, Cut(ifM„, ^)) ^ 



Also 



d{HM„,Cnt{HM^,^)) = \HmJ - |Cut(ifM„,i)| 



([2]) and (13]) conclude that 



Cutfi^Mn, -r 



d(^M„,Cut(^M„,^)) 



(3) 



d{HM„,Cnt{HM„,-))^0 
n 



Therefore Hm„ — > G, so Hn has a convergent subsequence and consequently, it is convergent 
itself ■ 



5 Examining the space for some other common topolog- 
ical properties 

Besides the completeness of Qq there are some other important properties of the space, we 
discuss a few of them here. Note that, in this chapter by a finite graph we mean one that has 
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only finitely many non-isolated vertices. Also in a metric space Ai we denote the ball with 
center x and radius r by 5x(a;, r). 

Theorem 10. Go is separable. 

proof. The set of all finite graphs with rational edge weights is a dense subset of ^o- * By 
a finite graph we mean one that has only finitely many non-isolated vertices. Also in a metric 
space we denote the ball with center x and radius r by Bm{x, r). 

Theorem 11. separable. 

proof. The set of all finite graphs with rational edge weights is a dense subset of ^o- * 
Theorem 12. Qq is not locally compact. 

proof. Suppose the contrary, so there is an open neighbourhood Bg^iQ., r) such that Sg;o(0' ^) 
compact. Consider the sequence {Gn} in which Gn is the jointless graph with n edges of weight 
-. This sequence must have a convergent subsequence. The tiny edges of this subsequence 
say that it converges to 0. On the other hand the norm of its members are always equal to r 
implying that the norm of the limit graph must be r, which is a contradiction. ■ 

Lemma 9. Wq is path connected. 

proof. Take the standard graph w. The function / : [0, 1] — > I^q*, f{t) —twisdi, path between 
and so every point is connected to via a path. ■ 

Theorem 13. Qq is path connected. 

proof. The onto function w ^ w from Wq to Qq is continuous and hence takes path connected 
to path connected. ■ 

Theorem 14. Qq is locally path connected. 

proof. First we show that every ball in Wq is path connected. Let w e Wq, r > and 

w' e Bw^{w,r). Choose a finite graph w" from Bw^■j{'w,r) (finite graphs are dense in Wq). 
Bwo{w, r) is convex and the graphs w' and w" have infinitely many isolated vertices in common, 
so we can define the function / : [0,1] Bws(^w,r), f{t) = tw' -|- (1 — t)w" which is a 
path between w' and w". So every member of Bivs(^w,r) is connected to w" via a path, 
therefore the ball is path connected. Now according to the facts that for each w e Wq we have 

Bg^{w,r) — BiYs(^w,r) (which is not difficult to prove) and that the function w ^ w from Wq 
to Qq is continuous, every ball in Qq is path connected and so Qq is locally path connected. ■ 

Acknowledgments. We are grateful to professor Sayyad EbadoUah Mahmoodian, for his 
valuable advices and comments during this work. We also thank all friends that directly and 
indirectly gave us hands in the preparation of this paper. 



13 



References 

[1] Binary codes capable of correcting deletions, insertions, and reversals by: Vladimir I Lev- 
enshtein Soviet Physics Doklady, Vol. 10, No. 8. (1966), pp. 707-710. 

[2] J. Balogh R. Martin, Edit distance and its computation. Electronic Journal of Combina- 
torics, 15(1), Research Paper 20, 27 pp 



14 



